Prosodic parameterization of spoken Japanese based on a model of the generation process of F0 contours

نویسندگان

  • Hiroya Fujisaki
  • Sumio Ohno
چکیده

The process of generating an F0 contour from a small number of linguistically meaningful parameters, has been modeled quite accurately, and the model has been used extensively in speech synthesis. The present study deals with the inverse problem, i.e., that of extracting the model parameters from a given contour, which can only be solved by successive approximation. This paper presents a method for deriving a first-order approximation to a given F0 contour from the linguistic information of the utterance, and refining the approximation by Analysis-by-Synthesis. The validity of the method has been confirmed experimentally.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech

Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degr...

متن کامل

Corpus-based Generation of F0 contours of Japanese based on the Generation Process Model and its Control for Prosodic Focus

A total corpus-based process of generating prosodic features form text is developed. The process first predicts pauses and phone durations, and then generates F0 contours. Since F0 contour generation is based on the generation process model, it is rather easy to manipulate the generated F0 contours in command level. A method was developed for generating sentence F0 contours, when a focus is pla...

متن کامل

Detecting accent sandhi in Japanese using a superpositional F0 model

In this report, we propose a method for automatic prosodic structure recognition of Japanese utterances based on a superpositional F0 model, focusing particularly on the accent sandhi phonemenon in compound nouns. The method enables automatic labeling of F0 contours using the model, which can be useful for creating prosodic databases containing F0 contours in a parametric form. The prosodic str...

متن کامل

Realization of Prosodic Focuses in Corpus-based Generation of Fundamental Frequency Contours of Japanese Based on the Generation Process Model

A method was developed for generating sentence F0 contours of Japanese, when a focus is placed in one of the “bunsetsu” of an utterance. It controls F0 based on the F0 model; not frame-byframe F0 prediction as in the case of HMM-based speech synthesis. The method first predicts differences in the F0 model commands between utterances with and without focus, and then applies them to the F0 model ...

متن کامل

Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model

We have been developing corpus-based synthesis of fundamental frequency (F0) contours for Japanese. Since, in our method, the synthesis is done under the constraint of F0 contour generation process model, a rather good quality is still kept even if the prediction process is done poorly. Although it was already shown that the synthesized F0 contours sounded as highly natural as those using heuri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996